Deep Reactive Policies for Planning in Stochastic Nonlinear Domains
نویسندگان
چکیده
منابع مشابه
Learning Reactive Policies for Probabilistic Planning Domains
We present a planning system for selecting policies in probabilistic planning domains. Our system is based on a variant of approximate policy iteration that combines inductive machine learning and simulation to perform policy improvement. Given a planning domain, the system iteratively improves the best policy found so far until no more improvement is observed or a time limit is exceeded. Thoug...
متن کاملDecomposition Techniques for Planning in Stochastic Domains Decomposition Techniques for Planning in Stochastic Domains
This paper is concerned with modeling planning problems involving uncertainty as discrete-time, nite-state stochastic automata. Solving planning problems is reduced to computing policies for Markov decision processes. Classical methods for solving Markov decision processes cannot cope with the size of the state spaces for typical problems encountered in practice. As an alternative, we investiga...
متن کاملDecomposition Techniques for Planning in Stochastic Domains
This paper is concerned with modeling p lann ing problems invo lv ing uncerta inty as d iscre tet ime, f in i te -s ta le stochastic au toma ta So lv ing p l ann ing problems is reduced to comp u t i n g policies for Markov decision processes Classical methods for solv ing Markov decision processes cannot cope w i t h the size of the state spaces for typ ica l problems encountered in pract ice ...
متن کاملDiscrepancy Search with Reactive Policies for Planning
We consider a novel use of mostly-correct reactive policies. In classical planning, reactive policy learning approaches could find good policies from solved trajectories of small problems and such policies have been successfully applied to larger problems of the target domains. Often, due to the inductive nature, the learned reactive policies are mostly correct but commit errors on some portion...
متن کاملReactive Policies with Planning for Action Languages
We describe a representation in a high-level transition system for policies that express a reactive behavior for the agent. We consider a target decision component that figures out what to do next and an (online) planning capability to compute the plans needed to reach these targets. Our representation allows one to analyze the flow of executing the given reactive policy, and to determine wheth...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the AAAI Conference on Artificial Intelligence
سال: 2019
ISSN: 2374-3468,2159-5399
DOI: 10.1609/aaai.v33i01.33017530